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Abstract 

We consider some generalizations of the Asymmetric Traveling Salesman Path problem. 
Suppose we have an asymmetric metric G = {V, A) with two distinguished nodes s,t G V. We 
are also given a positive integer k. The goal is to find k paths of minimum total cost from s 
to t whose union spans all nodes. We call this the fc-Pcrson Asymmetric Traveling Salesmen 
Path problem (fc-ATSPP). Our main result for fc-ATSPP is a bicriteria approximation that, 
for some parameter > 1 we may choose, finds between k and k + paths of total length 
0(&log \ V\) times the optimum value of an LP relaxation based on the Hcld-Karp relaxation for 
the Traveling Salesman problem. On one extreme this is an 0(log |y|)-approximation that uses 
up to 2k paths and on the other it is an 0(fclog |l/^|)-approximation that uses exactly k paths. 

Next, we consider the case where we have k pairs of nodes {(si, ti)}i=i- The goal is to find 
an Si — ti path for every pair such that each node of G lies on at least one of these paths. Simple 
approximation algorithms are presented for the special cases where the metric is symmetric or 
where Si = ti for each i. We also show that the problem can be approximated within a factor 
O(logn) when k = 2. On the other hand, we demonstrate that the general problem cannot be 
approximated within any bounded ratio unless P = NP. 



* Research supported by an iCORE ICT/AITF award while studying at the University of Alberta. 



Figure 1: The pictured arcs all have cost and the missing arcs have cost 1. The gap between 
optimum solutions for = 1 and = 2 is unbounded. 

1 Introduction 

We consider generalizations of the metric Traveling Salesman problem. In most of our settings we 
are given a complete directed graph G = {V, A) with n nodes. There are nonnegative arc costs 
duv,uv G A satisfying the directed triangle inequality duv < duw + dwv for any distinct u,v,w £ V. 
In general, duv / dyu ^or u,v £ V so we call such graphs asymmetric metrics. Some well-studied 
Traveling Salesman variants in asymmetric metrics are to find the minimum cost Hamiltonian cycle 
or minimum cost Hamiltonian path where some of the endpoints of the path may be specified in 
advance. All of these problems are NP-hard to approximate within small constant factors |27j . 

We will mainly study problems concerning finding multiple paths in asymmetric metrics whose 
union covers all nodes while minimizing the total cost of these paths. This generalizes the Asym- 
metric Traveling Salesman Path problem (ATSPP). Formally, define fc-ATSPP to be the following 
problem. Given an asymmetric metric G = {V, A) with arc costs duv and two distinct nodes s,t € V, 
we want to find k paths from s to i in G such that every node v £ V lies on at at least one of these 
paths. Finally, we define General A;-ATSPP to be the following generalization of fc-ATSPP. We are 
given k pairs of nodes (sj, tj), . . . , (s^, t^) in an asymmetric metric G and we are to find an Sj — ti 
path for each i so each v € V lies on at least one of these paths. Again, the goal is find such paths 
of minimum total cost. 

One thing that makes /c-ATSPP an attractive variant of ATSPP is that the gap between op- 
timum solutions for different values of k in an asymmetric metric may be arbitrarily large. For 
example, the instance in Figure [1] has a solution of cost using 2 paths but any single path has 
cost at least 1. One way to think about this is that it might be efficient to hire additional salesmen 
to cover the locations in an asymmetric metric. On the other hand, we do not have this large gap 
in symmetric metrics (i.e. when duv = dvu for all u^v £V) because a single salesman can cover all 
paths by traveling back and forth between s and t and cover all locations with no greater cost (if k 
is even then one final step from s to t makes it an s — t path while adding an extra OPT/k to the 
cost). 

1.1 Related Work 

In the well-studied Traveling Salesman problem (TSP), the goal is to find a Hamiltonian cycle of 
minimum total edge cost in a symmetric metric. A classic result of Christofides [9] is a polynomial- 
time algorithm for TSP that finds a Hamiltonian cycle with cost at most | times the cost of the 
optimum solution. Hoogeveen [TB] adapted this algorithm to the problem of finding minimum cost 
Hamiltonian paths. He obtains a |-approximation if at most one endpoint is fixed in advance and 
a |-approximation if both endpoints are fixed in advance. Recently, An, Kleinberg, and Shmoys 

have improved the approximability of the case when both endpoints are fixed to < 1.6181 [2]. 
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In asymmetric metrics, Frieze, Galbiati, and Maffioli |12j gave the first approximation algorithm 
for ATSP with an approximation ratio of log2 n where n = A series of papers improved on 
this ratio by constant factors O \20\ [TO] with the last being | log2 n. Finally, Asadpour et al. [3] 
produced an asymptotically better approximation algorithm for ATSP with ratio O (log n/ log log n). 

The variant of finding Hamiltonian paths in asymmetric metrics, namely ATSPP, has only 
recently been studied from the perspective of approximation algorithms. The first approximation 
algorithm was an 0(\/n)-approximation by Lam and Newman [22]. Following this, Chekuri and Pal 
[8] brought the ratio down to 0(log n). Finally, Feige and Singh [10] proved that an a-approximation 
for ATSP implies a (2 + e)a-approximation for ATSPP for any constant e > 0. Combining their 
result with the recent ATSP algorithm in [3] yields an O (log n/ log log n)-approximation for ATSPP. 

There is a linear programming (LP) relaxation for each of these problems based on the Held- 
Karp relaxation for TSP [T7]. For TSP, this relaxation is: 

minimize : Y^uvaE duvXuv 

such that : x{6{v)) = 2 \/ v e V 

x{d{S)) > 2 V0C5cy 

Xuv G [0, 1] y uv £ E 

Many of the approximation algorithms mentioned above also bound the integrality gap of the respec- 
tive Held-Karp LP relaxation. For TSP, Wolsey [32] proved the solutions found by Christofides' al- 
gorithm [9] are within 3/2 of the optimal solution to the above LP relaxation. For ATSP, Williamson 
|31| proved that the algorithm of Frieze et al. [12] bounds the integrality gap of its respective LP 
by log2 n. The improved 0(logn/loglogn)-approximation for ATSP in [3] improved the bound on 
gap to the same ratio. For TSP paths. An, Kleinberg and Shmoys [2] first showed that Hoogeveen's 
algorithm bounds the integrality gap of a Held-Karp type relaxation for TSP paths by | in cases 

where both endpoints are fixed. In the same paper they argue that their ^-'^^-approximation for 
this case also bounds the integrality gap by the same factor. 

Nagarajan and Ravi [26] first showed that the integrality gap of an LP relaxation for ATSPP, 
which is the same as LP ([T|) in this paper when k = 1, was 0{^/n). Later Friggstad, Salavatipour, 
and Svitkina [13] showed a bound of O(logn) in the integrality gap of this LP relaxation which 
is currently the best bound. We note that the result of Feige and Singh in [10] that relates the 
approximability of ATSP and ATSPP does not extend to their integrality gaps in any obvious way. 

In the full version of [13], the authors studied extensions of their O(logn)-approximation for 
ATSPP to /c- ATSPP. They demonstrated that fc- ATSPP can be approximated within 0{k'^ logn) 
and that this bounds the integrality gap of LP ([T|) by the same factor. Though not stated explicitly, 
their techniques can also be used to devise a bicriteria approximation for A;-ATSPP that uses 
0(A;logn) paths of total cost at most 0{klogn) times the value of LP ([T|) in a manner similar to 
the algorithm in the proof of Theorem 1.3 in [13]. As far as we know, no results are known for 
General A;- ATSPP even for the case k = 2. 

One other problem related to one we consider is the following. We are given 2k distinct nodes 
S = {si, . . . , Sfc} and T = {ti, . . . , t^} in a symmetric metric. We want to find k paths whose union 
spans all nodes. This should be such that each node in S is the start node of exacly one path 
and each node in T is the end node of exactly one path. Matroid intersection techniques used by 
Rathinam and Sengupta [28] can be easily adapted to approximate this problem within a factor 2. 

1.2 Our Results 

By the directed triangle inequality, it is easy to see that there is an optimum solution for an instance 
of fc- ATSPP where each node in ^ — {s, t} lies on precisely one of the k paths. Such an optimum 
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solution corresponds to an integer point in LP ([T]) of the same cost. So the optimum value of LP 
([1]), say OPTlp, is a lower bound for the minimum cost A;-ATSPP solution. Our main result for 
/c-ATSPP is the following. 

Theorem 1.1 For any integer b > 1, there is an efficient algorithm for k-ATSPP that finds between 
k and A; + ^ paths of total cost at most 0(6 log n) • OPT^p. 

This is an 0(/clogn)-approximation using precisely k paths when b = k + 1 and an O(logn)- 
approximation using at most 2k paths when 6 = 1. 

The algorithm is also easy to implement with the most complicated subroutine being that of 
finding a minimum weight perfect matching in a bipartite graph. Its running time can easily be 
seen to be 0(M(n + k — 2)6 log n) where M(N) is the time it takes to compute such a matching in 
a complete bipartite graph with A'^ nodes on each side. 

We then proceed to study variants of /c-ATSPP that vary how the start and/or end locations 
are specified. Examples are when the start locations are not fixed or when we have a set of k start 
nodes S and a set of k end nodes T and the start and end locations of the paths should establish a 
bijection between S and T. We extend our approximation algorithm for A;-ATSPP to these variants. 

Finally, we study General /c-ATSPP. Our first result is an 0(log n) for General 2-ATSPP. We also 
have a 3-approximation for General /c-ATSPP in symmetric metrics and an 0(log n)-approximation 
for General /c-ATSPP when Sj = tj for all i. However, we have the following hardness result for 
General /c-ATSPP with no further restrictions. 

Theorem 1.2 It is NP-hard to distinguish between instances of General k-ATSPP whose optimum 
solution has cost and instances whose optimum solution has cost at least 1. 

This implies the problem cannot be efficiently approximated within any bounded ratio unless P = 
NP. While the reduction uses k = n/4, modifications can be made to prove hardness results (under 
stronger assumptions) for values of k being as small as polylogarithmic in n. 

To summarize, Section [2] presents the algorithm for /c-ATSPP, proves Theorem ll.H and discusses 
some variations of /c-ATSPP on how the start and/or end locations are specified. In Section [3] we 
demonstrate an 0(log n)-approximation for General 2-ATSPP, discuss approximation algorithms 
for other restrictions of General /c-ATSPP, and prove Theorem II. 2[ Section |4] then concludes this 
paper by identifying some directions for future work and mentioning some basic results for the 
alternative goal of minimizing the cost of the most expensive path in either /c-ATSPP or General 
/c-ATSPP. 

2 A Bicriteria Approximation for /c-ATSPP 

In this section, we will develop a bicriteria approximation algorithm that finds approximately k 
paths from s to t in an asymmetric metric G = {V^ A) whose total cost is within some bounded 
ratio of the optimum value of LP relaxation ([T]). The algorithm is parameterized by a positive 
integer 6; different bicriteria approximation guarantees result from different choices of 6. 

2.1 Preliminaries 

If X is a flow between two nodes or a circulation then we let X^v denote the value that X assignes 
to arc uv £ A. For S" C y we let X{5^{S)) = Eue^.i-eV-S ^™ ^'^'^ ^{^~{S)) = Y^vav s,u&s ^^u- 
For brevity, we let X{5+{v)) := X{5+{{v})) and x(6^{v)) := X{5-{{v})) for v£V. We'say X is 
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integral if X^v is an integer for each arc uv S A. The cost of X is X^^^g^c^™ • X^v All flows and 
circulations X in this paper will have X^v > for each arc uv £ A. 

Our starting point will be to use structures similar to cycle covers from |12j and path/cycle 
covers from [22] and [13]. 

Definition 2.1 A k-path/cycle cover from s to t is an integral flow F such that F{5^{v)) = 
F{6-{v)) = 1 for each veV- {s,t}, F{6+{s)) = F{6-{t)) = k, and F{6-{s)) = F{6+{t)) = 0. 

Note that in a /c-path/cycle cover F the flow Fuv across any arc uv € A — {st} is either or 1 and 
Fst < k. If we regard F as a multiset of arcs, then F may be decomposed into k paths from s to t 
and a collection of cycles where every v £ V — {s,t} lies on exacly one path or exactly one cycle. 
We can efficiently find a minimum-cost /c-path/cycle cover using a standard reduction to minimum 
weight perfect matching in a bipartite graph with n + k — 2 nodes on each side. 

LP ([1]) is the LP relaxation for fe-ATSPP we consider. It is similar to the LP relaxation for 
ATSPP considered in [26] and |13j . 

minimize : duvXuv (1) 

subject to : x{6~^ (v)) = x{5~ (v) = I yv£V-{s,t} 
x((5+(s)) = x{6-{t)) = k 
x{6-~{s)) = x{6+{t)) = 

x{6+{S))>l V{s}C5cy (2) 

Xuv > y uv £ A 

We will break the presentation of the algorithm into two parts. The first is a loop that runs 
for 0(61og?7-) iterations. Each iteration will find the cheapest /c-path/cycle cover on the remaining 
nodes and discards some nodes from the cycles or, more generally, circulations formed by taking 
the union of the current /c-path/cycle cover and paths from previous iterations. These discarded 
nodes can be added to the final solution by using the circulations to "graft" them into the paths. 
It is similar to the main loop in the algorithm for ATSPP in |13| . 

After this first phase we will have G(6A;logn) paths of cost from s to t whose union is acyclic 
and covers all remaining nodes. However, our goal is to use at most k + ^ paths. The second part 
of the algorithm will assemble only A; + ^ paths that cover all remaining nodes using arcs from the 
Q{bklogn) paths. This will be possible because we will carefully select which nodes to discard in 
each iteration so that each remaining node lies in almost a ^-fraction of the Q(bklogn) paths after 
the main loop. So, if we regard our @{bklogn) paths as a flow then each node supports almost a 
^-fraction of this flow. If we scale the flow by 0(6 log n) then we have a flow sending 0(A;) units 
from s to t where every other remaining node supports units of this flow. 

We will show how to round this flow to obtain an acyclic integral flow of roughly the same cost 
that sends between k and A; + | units of flow from s to t so that each remaining node supports 
exactly 1 unit of this flow. Then a path decomposition of this acyclic integral flow yields the 
required paths. Finally, we graft the nodes that were discarded in the first phase to these paths 
using the circulations that were removed in the first phase. 

2.2 The Algorithm 

Let 6 > 1 be an integer, this is the b in the statement of Theorem ll.il For notational convenience, 
we will let L be (6 + 1) [log2 n\ for the remainder of this section. Algorithm [1] is the fc- ATSPP 
bicriteria approximation. 
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Algorithm 1 The Bicriteria Approximation for A;-ATSPP. 



Let W \> W \s the set of nodes that have not been discarded yet 

Let ly = Q,\/v gV Used to decide which nodes to discard from a circulation 

Let ^ 0, Vn, v G V > F will be an acyclic s — t flow using only nodes in W 

Let Cuv ^ 0,\/uv gV > C will be a circulation on the nodes in — 

L^(6 + l)Llog2nJ 
for L iterations do 

Find a minimum-cost A;-path/cycle cover F' on W 

F ^ F + F' 

Subtract a circulation H from F so -F is acyclic again 
for each connected component A in the support of H do 

Let VA ^ argmin^gA^ii + H{5\{u)) > breaking ties arbitrarily 

for each node w G A — {va} do 

Shortcut the flow in F over w so F[5'^{w)) =0 \> F remains acyclic 

end for 

W ■^W — {A — {va}) > The nodes m. A — {va} will be reintroduced in step [22] 

+ H{5+{va)) 

end for 

C ^C + H 
end for 

Use Corollary 12. 101 to find k' G [k,k + ^] s — t paths whose union covers all nodes in W of total 
cost at most the cost of F. Let P be the s — t flow of value k' defined by these paths. 
Let B be the simple t — s flow with Bts = k' and B^v = for every other arc. 
LetC ^ P + C + B 

Find an Eulerian circuit K using each arc uv eactly C^^ times. i> See Lemma 

Shortcut K so it visits each node in V — {s,t} exactly once. 
Delete the k' edges ts from K to obtain k' paths Pi, ... , Pk'. 
Return Pi , . . . , P^,/ . 



The proof of the following lemma is simple and is found in Appendix I A. 1[ 

Lemma 2.2 In Step \2SX C is an integral circulation whose support is strongly connected in 
G{V,A). 

We assume, for now, that Step[20]works as stated. If so, we can prove the following combinatorial 
analog of Theorem 11.11 that compares the cost of the solution to the optimum /c-ATSPP cost OPT 
solution rather than the value of LP ([1]) . 

Theorem 2.3 Each node v G V — {s, t} lies on exactly one of the s — t paths Pi, ... , Pk' returned 
by AlgorithniUl and the total cost of these paths is at most L ■ OPT. 

Proof. Since C is a circulation whose support is strongly connected in G{V, A), then every node 
in V is visited by the Eulerian circuit. Since the shortcutting did not bypass around either s or 
t, then the ts arc still appear exactly k' times in K and there are no vs or tv arcs in K for any 
V G V — {s,t}. So, after removing the k' occurences of the t — s arc from K, we have a collection 
of precisely k' paths from s to t whose union visits all nodes. 

All that is left is to bound the final cost by L ■ OPT. We prove, by induction on i, that the 
cost of P + C after iteration i is at most i ■ OPT. For i = (before the first iteration) this is clear 
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and we now assume that i > and that the cost of F + C just before the i'th iteration is at most 
(i - 1) • OPT. 

Let Wi be the subset of nodes W in the i'th iteration. We can obtain a feasible A;-path/cycle 
cover on Wi of cost at most OPT by shortcutting an optimum fc-ATSPP solution on V around 
nodes in V — Wi. Thus, the minimum cost /c-path/cycle cover F' on Wi has cost at most OPT. 
After adding F' to F we have that the cost of -F + C is, by induction, at most i ■ OPT. The rest 
of the body of the loop simply moves flow between F and C and shortcuts some flow so the cost of 
F + C is still bound by i ■ OPT at the end of the i'th iteration. 

The cost of the circulation C in Step [22] is then at most L-OPT plus the cost of B. Shortcutting 
past nodes in the Eulerian circuit K yields an circuit whose total cost is still at most L ■ OPT plus 
the cost of B. The paths Pi, ... , P^' are formed by removing the k' arcs from t to s which have 
cost exactly the cost of B. Thus, the cost of the returned paths is at most L • OPT. □ 

2.3 Finding the Flow P in Step [20] 

Consider the acyclic integral flow F after the main loop terminates. It is possible to argue that our 
choice of va in Step [11] implies each v €W has F{5^{v)) > so a path decomposition of F yields 
a collection of paths whose union covers all nodes. However, the number of paths is kL which is 
much larger than our desired value k + ^. The fact that we can find fewer paths is essentially due 
to the fact that every v G W — {s,t} supports a lot of flow in F. 

The main object of concern in this step is the following polytope V{D) where I? G M. In V{D), 
we have a variable Zuv for every arc uv in the subgraph induced by W. The full description of 
V{D) is: 



z{6+ {w)) = z{6- (w)) = 1 yweW-{s,t} (3) 

zi6^{s)) = z{6-{t))= D (4) 

z{6-is)) = zi6^{t))= (5) 

< Zuv < Fuv V ordered pairs u,v G V (6) 



Since the support of F is acyclic and the support of z is required to be a subset of the support 
of F, then any integral point z S P{T)) corresponds to a flow P of the form required in Step [20l 
with k' = D. Notice that Constraints ([6]) imply that the cost of z would be at most the cost of F. 
Thus, our goal is to flnd a value D E [A:, /c + ^] for which V{D) has an integer point. The proof of 
the following property is standard {eg. [29j). 

Lemma 2.4 Every basic point in polytope V{D) is integral when D is an integer. 

So, to prove V{D) has an integer point for a given integer D it suffices to prove that V{D) 
contains any point. That is, for D € Z if there is some point z G P{D) with, perhaps, rational 
coordinates, then there is certainly a point z' with integer coordinates since V{D) is integral. 

The following lemmas are proven in essentially the same way as Lemmas 2.3 and 2.5 in the 
extended abstract of [13], so we omit their proofs. 

Lemma 2.5 Throughout the course of the algorithm, 1^ < [log2 n\ for every v € W . 

Lemma 2.6 When the main loop terminates, F[5~^{v)) = L — 1^ for each v £W — {s,t}. 

We will require that all node in W — {s,t} support the same amount of flow in F. The following 
lemma shows how to construct such a flow through simple modifications of F. 



6 



Lemma 2.7 For some < 7 < [log2 there is an acyclic integral flow F' sending kL units of 
flow from s to t where every v &W — {s,t} has F'{6'^{v)) = L — 7. Furthermore, the cost of F' is 
at most the cost of F. 

Proof. Let 7 = max^^\y_^g ^'^ 1^ and recall that 7 < [log2 n\ by Lemma 12.51 While some 
w G W — {s,t} has F{5^{v)) > L — 7 (c/. Lemma [2.6p . shortcut F past w as follows. Choose 
any two arcs uw,wv with Fuw,Fyjy > 0. Then subtract 1 from both Fuw and Fy^^ and add 1 to 
Fuv Note that F remains integral and acyclic after such an operation and that the cost of F does 
not increase by the triangle inequality. We let F' be this flow F once all u; G — {s, t} have 
F((5+(u)) = L-7. □ 
From now on, we will assume that F[5~^{v)) = L — 7 for every v £ W — {s, t} where 7 is some 
integer at most log2 n. The following lemma is the first step to finding a good integer D for which 
V{D) 7^ 0. The value may be fractional, but we will deal with that problem later. 

Lemma 2.8 V (r^) / for some < 7 < [log^ n\ . 

Proof. Let D = and define a point z by Zuv = "f^- It is easy to verify z G 'P(-D) after 
noting that 7 < [log2 n\ and 6 > 1 implies L — 7 > 1. □ 
The main problem is that the may not be an integer. The following lemma fixes this by 
mapping a point in V{D) to a point in 'P([-DJ) be modifying flow along "sawtooth" s — t paths 
that alternate between following arcs in the support of z in the forward and reverse directions. 
This can be used to decrease both z{6~^{s)) and z{6~ (t)) while preserving z{d^{v)) = z{S~{v)) = 1 
at all other nodes. The fact that such a sawtooth path always exists if D is not an integer is a 
consequence of the fact that the total flow through all nodes in W — {s,t} is integral. The full 
proof appears in Appendix IA.21 

Lemma 2.9 IfV{D) / 0, then V{[D\) / 0. 

Corollary 2.10 There is an integer k' G [k,k + \] such that V{k') / 0. That is, we we can find 
between k and k + ^ paths from s to t whose union spans all nodes in W . Furthermore, the cost of 
these paths is at most the cost of the flow F after the main loop of AlgorithmUl 



Proof. Lemmas 12.81 and 12.91 show V 
since 

k < 



kL 

L-7 



^ for some < 7 < [log2jn. The result follows 



kL 



L-7. 



^ fc(& + l)Llog2nJ ^j^^ ^ k 
6[log2nJ b' 



□ 

In the next section, we complete the proof of Theorem 11.11 by showing the cost is actually at 
most 0(61ogn) times the value of LP relaxation ([T]). 



2.4 Bounding the Integrality Gap 

For a subset W '^V containing both s and t, let LP{W) denote LP relaxation ([1]) on the asymmetric 
metric induced by W. Consider the LP obtained by removing Constraints ([2]) from LP{W). The 
resulting LP is integral [29] whose integer points correspond to /c-path/cycle covers of W. The 
following holds because removing the constraints from a minimizaton LP does not increase its 
value. 
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Lemma 2.11 For any subset W V containing s and t, the minimum cost of a k-path/cycle 
cover ofW is at most the value of LP{W). 

The proof of the next lemma is the same as the proof for the case A; = 1 in [13] . It uses sphtting 
off techniques developed by Frank [TT] and Jackson [TO] for Eulerian graphs. The idea is that we 
obtain an Eulerian graph by multiplying an optimum basic (thus fractional) point in LP ([1]) by a 
large enough integer and identifying s and t. Since we are only concerned with preserving cuts for 
subsets S of W that include s, then using splitting off techniques to bypass nodes in V — {s, t} in 
this Eulerian graph and then scaling back to a fractional point x' will preserve x'{6^{S)) > 1 in 
the graph induced by W while not increasing the cost. 

Lemma 2.12 For any subset WofV containing s and t, the value of LP(W) is at most the value 
OPTlp ofLP{V). 

Now we can complete the proof of Theorem 11.11 

Proof of Theorem 11.11 By Lemmas 12.111 and 12.121 the cost of each /c-path /cycle cover found in 
Step ((Tj) is at most OPTlp . The proof of Theorem 12.31 shows that the final cost of the paths is at 
most the sum of the costs of each /c-path/cycle cover found. So, the total cost of the k + ^ paths 
found is at most L ■ OPTlp = 0(6 log n) • OPTlp- □ 

2.5 Varying the Endpoints 

Consider the following ways to specify the start locations of the paths: each path may start at 
any node (No Source), all paths start at a common node s (Common Source), or there are nodes 
si, . . . , Sfc where each must be the start of some path (Multiple Source). We can also consider anal- 
ogous ways to specify the end locations of the paths. In Multiple Source/Multiple Sink instances, 
we only require each path start at some Sj and end at some tj. It may be that some paths start 
and end at locations with different indices. 

The following theorems are easy to verify and the proofs are only briefly sketched. We let OPT 
be the cost of the optimum solution using exactly k paths for the A;-ATSPP variant in question. 

Theorem 2.13 For any integer h > 1, there is an approximation algorithm for the No Source/Single 
Sink variant of k-ATSPP that finds at most A: + | paths of total cost at most 0{blogn) ■ OPT. 

Proof. Simply add a new start node s and set sv = and vs = oo for every v £ V. Then use 
the approximation algorithm from Theorem II. 1[ □ 

Theorem 2.14 For any integer b > 1, there is an approximation algorithm for the Multiple 
Source/Single Sink variant of k-ATSPP that finds at most k -\- ^ paths of total cost at most 
0{blog{n + k)) -OPT. 

Proof. Let si, . . . , be the multiple sources. Create a start node s and k other new nodes 
s'^, . . . , s'f^. Add cost arcs from s to s'- and from s'- to Si for each i. Add a cost oo arc from v to 
s for every v £ V. Use Theorem 11.11 on the shortest paths metric of this graph. The intermediate 
nodes s'^ are to ensure that each Si is the start location of some path. □ 

Combining the constructions in Theorems 12.131 and 12.141 can also be used to combine different 
start and end location specifications {e.g. No Source/Multiple Sink). 
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3 General fc-ATSPP 



Recall that in General fc-ATSPP, we are given k pairs of nodes (si,ti), . . . , (sk,tk)- The goal is to 
find an Si — ti path for each 1 < i < A; so that every v £ V lies on at least one such path. This 
differs from the Multiple Source/Multiple Sink variant described in Section [2.51 since the path at Sj 
must end at ti, rather than merely requiring that the start and end nodes of the paths establish a 
bijection between {si, . . . , Sk} and {ti, . . . , tk}- 

3.1 Approximating General 2-ATSPP 

Let (si,ti) and (s2;^2) be pairs of nodes we are to connect. Furthermore, suppose all four of these 
endpoints are distinct (by creating multiple copies of locations if necessary). This means there is 
an optimum General 2-ATSPP solution where the two paths are vertex disjoint. 

Notice that the optimum Multiple Source/Multiple Sink 2-ATSPP solution for the case with 
sources {si,S2} and sinks {ti,t2} is a lower bound for OPT. The first step of the algorithm is to 
run an a-approximation for this variant of 2-ATSPP. This gives us two paths Pi,P2 starting at 
si,S2, respectively. If Pi ends at ti (equivalently, P2 ends at t2), then these paths form a valid 
solution for the General 2-ATSPP problem with cost at most a • OPT. Otherwise, we have si — t2 
and S2 — ti paths. We use the following lemma which is proven in Appendix IA.31 It gives us a 
relatively cheap way to adjust these paths to get si — ti and S2 — ^2 paths. 

Lemma 3.1 There are nodes ui,V2 on Pi and vi, U2 on P2 such that the following hold: a) ui = f 2 
or U1V2 is an arc on Pi, b) vi = U2 or V1U2 is an arc on P2 and c) du^u2 + dv^v2 ^ OPT. 

The rest of the General 2-ATSPP algorithm proceeds as follows. Try all O(n^) guesses for 
ui,U2,vi, V2 where either ui = V2 or U1V2 is an arc on Pi and either vi = U2 or V1U2 is an arc on 
P2. For each guess, construct a path Qi by traveling along Pi from si to ui, then using the nin2 
arc in G, and then traveling along P2 from U2 to ti. Construct Q2 in a similar manner by traveling 
from S2 to -^i on P2, then using the ^1^2 arc in G, and then traveling along Pi from V2 to t2- It is 
easy to see that Qi and Q2 form a feasible solution for this General 2-ATSPP instance. Since each 
arc on Pi and P2 is traversed at most once by Qi and Q2, then the total cost of Qi and Q2 is at 
most Q • OPT + duiu2 + dviv2- Output the cheapest solution found over these guesses. 

When the algorithm guesses nodes ui,U2, vi,V2 from Lemma [3T] we have d^j^j + d^^^jj < OPT. 
It is straightforward to verify that each arc in Pi and P2 is used at most once between Qi and Q2. 
So the final cost of Qi and Q2 is at most (a + 1) ■ OPT. To summarize: 

Theorem 3.2 // there is an a-approximation for the Multiple Source/Multiple Sink variant of 
2-ATSPP then there is an (q -|- 1) -approximation for General 2-ATSPP. 

By setting 6 = 3 and using Theorem 12.141 composed with an analogous result for Multiple Sink 
instances. Theorem 13.21 gives us the following. 

Corollary 3.3 There is an O {log n)- approximation for General 2-ATSPP. 

3.2 Approximating Other Restrictions of General A;-ATSPP 

We breifly mention a couple of variants of General /c-ATSPP that can be approximated well. We 
leave their full descriptions to App endix I A . 41 since they are simple variants of known algorithms for 
some TSP variants. 



9 



u 




i) ii) 



Figure 2: i) An instance of tripartite triangle packing with n = 2. ii) The corresponding graph H. 

The first variant is when the metric is symmetric. In this case, there is a simple 3-approximation 
using a tree doubling approach. Next, if Si = ti for each 1 < i < /c, then the problem is to find a 
cycle cover where each cycle contains some root node Si where we allow cost loops on the nodes 
Si (corresponding to the salesman at Sj going directly to ti). This version can be approximated 
within [log2(?T- — k)\ + 1 using a modification of the ATSP algorithm by Frieze et al. \12\ . 

3.3 Hardness of General fc-ATSPP 

We will use the following NP-complete. 

Definition 3.4 In the Tripartite Triangle Packing problem, we are given a tripartite graph G = 
{U L) V D W, E) with \U\ = \V\ = \W\ = n where no edge in E has both endpoints in a common 
partition U,V, or W . A triangle is a subset of 3 nodes for which any two are adjacent in G. The 
problem is to determine if it is possible to find n vertex- disjoint triangles in G. 

NP-completeness of this problem is essentially shown in |14j . Technically, only related problems 
are proven to be NP-complete in this book. However, consider the 3D Matching problem that is 
proven to be NP-complete in Theorem 3.2 on page 50. If each triple M = {u, v, w} is replaced with 
edges uv, vw, wu then we obtain an instance of Tripartite Triangle Packing. Careful inspection of 
the proof of Theorem 3.2 of |14j shows that the only triangles {uv, vw, wu} that can be formed must 
have come from some triple M = {u, v, w} in the 3D Matching reduction. Thus, the Tripartite 
Triangle Packing problem is also NP-complete. 

For our reduction to General fe-ATSPP, let G = {U D V L) W, E) be an instance of Tripartite 
Triangle Packing with \U\ = \V\ = \W\ = n. Create a directed graph H with four layers of nodes 
Xi,X2,X3,X4 where Xi and X4 are disjoint copies of U, X2 is a copy of V, and X3 is a copy of 
W. For every edge e in G, there is a unique index 1 < i < 3 such that the endpoints of e lie in Xi 
and Xi^i. Add this arc to H, direct it from Xi to Xi^i, and set its cost to 0. This is illustrated in 
Figure [2j Set k := n and consider the General /c-ATSPP instance on H' obtained from the shortest 
paths metric H where we set the cost of a uv arc to be 1 if there is no ti — w path in H. For each 
u £ U, we have a source/sink pair from the copy of u in Xi to the copy of u in X4. 

The details of the following claim are simple and can be found in Appendix IA.5I The proof of 
Theorem 11.21 immediately follows. 
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Claim 3.5 There is a Tripartite Triangle Packing solution in G if and only if the optimum General 
k-ATSPP solution in H' solution has cost 0. 

Note that the value k is n/4 in the above reduction. Similar hardness results for smaller values 
of k (as a function of n) are established in Appendix IA.6I down to k being polylogarithmic in n. 
The complexity of approximating the case when is a constant at least 3 remains open. 

4 Future Directions 

Our best approximation for /c-ATSPP that uses exactly k paths has an approximation guarantee 
of 0{klogn). Can the dependence on k in the approximation ratio be reduced? Perhaps there is 
an 0(polylog(n, A:))-approximation for A;-ATSPP that uses only k paths. On the other hand, the 
problem might be hard to approximate much better than k. Also, as far as we know the integrality 
gap of LP (m could be n{k). 

For General /c-ATSPP, the case A; = 1 is simply ATSPP and we proved an 0(log n)-approximation 
for k = 2. Is there a more general 0{f{k) ■ log n)-approximation for General A;- ATSPP whose run- 
ning time is polynomial when /c is a constant? 

Finally, rather than minimizing the total cost of all paths we might want to minimize the 
cost of the most expensive path. This can be thought of as minimizing the total time it takes 
for agents moving simultaneously to visit all locations. From Theorem 11.11 and the observation 
that the total cost of k paths is at most k times the cost of the most expensive path, we get 
an 0(6/c log n)-approximation for this variant that uses at most k + ^ paths. Also, Theorems 
IA.5I and IA.6I (Appendix IA.7P show that approximation algorithms for Directed Orienteering and 
Directed /c-Stroll can be used to obtain other bicriteria approximation algorithms. The current 
best approximation algorithms for Directed Orienteering [71 [25] and Directed fc-Stroll [1] imply the 
following: 

Corollary 4.1 There is an efficient algorithm that finds 0{klog^ n) paths from s to t of maximum 
cost OPT whose union covers all nodes in V . 

Corollary 4.2 There is an O {log^ n)- approximation uses at most 0{klogn) paths. 

We leave it as an open problem to improve these bounds. In particular, is it possible to obtain a 
polylogarithmic approximation that uses only 0{k) paths? We also note that the hardness results 
for General /c- ATSPP proven in this paper also hold for the the variant where we want to minimize 
the maximum cost of the Si — ti paths. 
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A Appendix 

A.l Proof of Lemma 12.21 

We see that C is an integral circulation since it is the sum of an integral circulation C, an 
integral s — t flow P of value k' , and an integral t — s flow B of value k' . To prove its support is 
strongly connected in G{V, A) it suffices to show that there is a — t path in the support of C for 
every v £ V. Suppose Wi is the set of remaining nodes W just after the i'th iteration of the loop 
(and Wq = V). We prove by induction on L — i that every node in Wi has a path to v. 

Consider the base case i = L. Since P only uses arcs in the support of F, since F is acyclic 
(from Step [9]), and since F{6~^{v)) > for v G W — {s,t} (Lemmas 12.51 and 12. 6p then any walk from 
any v G Wl in the support of P of maximal length must end at t. 



13 



Now suppose i < L and that every node in Wj+i has a path to t in the support of C . Let v S Wi, 
if w G VFj+i then, by induction, v has a path to i. Otherwise, v must have been removed in Step 
[TSlof the current iteration for some circulation A. But then va, the representative of circulation A 
that was chosen to remain in W, is in VFj+i, so it has a path to t by induction. Finally, v also has 
a path to t in the support of C since it can travel first to va in A and then, by induction, to t. □ 

A. 2 Proof of Lemma 12.91 

Suppose D is not an integer, otherwise we are already done. Let z be any point in V{D). Form an 
undirected and weighted bipartite graph H = (Wl U Wr, E') where both Wl and Wr are disjoint 
copies of W . We identify an arc uv of G with an edge uv in H with u € Wl and v G Wr. That is, 
for each arc uv add an edge from u € Wl to w G Wr with weight Zuv By constraints ([3]), dH and 
([U, we have that z{5{v)) (the total value of all edges in E' incident to v) is an integer for every 

V £ Wl U Wr except for s € Wl and t G Wr. 

We claim there is a path from s E Wl to t £ Wr in that only uses edges uv with Zm, > 0. 
Note that these paths are allowed to take a step from Wr to Wl since is undirected. Such a 
step corresponds to following an arc uv in the reverse direction in the original directed graph G. 

Suppose, for the sake of contradiction, that there is no path from s 6 Wl to t G Wr using only 
edges uv with Zuv > 0. Let S be the collection of all nodes in H that can be reached from s S Wl 
using only edges with positive z-value; our assumptions means the copy of t in Wr is not in S. 

Let z{E{S)) denote the total z-value of edges with both endpoints in S. On one hand, since 
every node v S Wr — {t} has z{5{v)) = 1 and since t ^ S, then 

z{E{S))= Yl z{5{v)) = \WRnS\. 
On the other hand, since every node v S Wl — {s} has z{6{v)) = 1 and since s £ S, then 

z{E{S))= J2 my)) = \{WL-{s})ns\ + z{5{s)) = \{WL-{s})ns\ + D. 

But |i? n SI = \{Wl — {s}) n S\ + D contradicts our assumption that D is not an integer. So, it 
must be that there is a path from s € Wl to t £ Wr in H using only edges e with Ze > 0. 

Suppose that such a path followed a sequence of edges ei, 62, • • • , ec. Since H is bipartite and 
s £ WL,t £ Wr are on different sides, then c must be odd. Let 

6 = min { D — [D\ , min z^, . > 

l<j<c: i odd ' J 

be the minimum z-value of the edges that are followed from Wl to Wr when walking along this 
path (or the difference between D and \_D\ if this is smaller). Update the z values of the edges on 
this path by: 

( Ze — 5 i odd 
[ Ze^ + I even 

We will now argue that the resulting z- values fit in the polytope V{D — S). First, notice that 
both z{6{s)),s £ Wl and z{5(t)),t £ Wr, which were originally D, decrease by exactly 6. Any 
other node v not on this path does not have the z-value of any incident edge changed. Finally, if 

V is an internal node on this path then the z-value of one incident edge decreases by 6 and the 
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z-value of another incident edge increases by 5. Therefore, we have z{5{v)) = 1 after this update 
for every internal node v on this path. 

By our choice of 5, we maintain Ze > for every edge e. Now, if the path was a single edge st 
then no edge had its z-value increased so the bounds Ze < continue to hold for every edge e. 
Otherwise, every edge e = uv on this path has either z{6{u)) = 1 or z{5{v)) = 1 so Zg < 1 must 
hold after this update. Since the only Ze values that are increased are those in the support of the 
integral flow F, then Ze < 1 < Fg. 

The above process maps a point from V{D) to a point m.V{D — 5) when D is not an integer. 
If 5 was chosen to be — [DJ > 0, then z{5{s)) = z{5{t)) = [DJ after this process and we are 
done. Otherwise, we can repeat the process to map a point from V{D — 5) to V{D — 5') for some 
6' > 6 with D — 6' > [D\ and so on. Each such step that does not result in a point in the polytope 
V{[D\ ) has us remove at least one edge from the support of z. Since no edges are introduced to the 
support of z, then this process will terminate in finitely many iterations with a point in VdDl). □ 

A. 3 Proof of Lemma 13.11 

Let P[ be the si — ti path and be the S2 — ^2 path in some fixed optimum solution. Also, 
for nodes u,v € V we use u ^ v to indicate that both u and v appear on the same path in our 
optimum solution P[, P2 and that u appears sometime before v on this path. 

Let a, h be such that ab is the first arc on Pi with a G and 6 G P2. Such an arc exists because 
Pi starts with a node in P[ and ends in a node in P!^. Let x be the first node along P2 such that 
a < X. We know such a node exists because a -< ti and ti G P2. Now, let y be the furthest node 
along P2 but still before x such that either y G P{ or y G P2 and y ^ b. Again, such a node exists 
because S2 G P2 and S2 -< b. 

Suppose y G P2 with y < b. If y is immediately followed by x on P2 then we let ui = a,U2 = 
x,vi = y,V2 = b. Otherwise, say w is the immediate successor of y on P2. By our choice of y we 
have w E P2 and b < w. In this case, we set ui = b,U2 = w, vi = y,V2 = b. This case is illustrated 
in Figure [3l 

Next, suppose y G P[. Let z be the first node on P2 that lies on P[. Note that z occurs no later 
than y on P2 (it may be equal to y). Now, by our choice of a on Pi, every node between si and 
a on Pi also lies on P[. We also have that si appears before z on P[ (since si is the start of Pi) 
and, by our choice of x and the fact that z appears before x on P2, we have that z appears before 
a on P[. So, there must be some arc cd on the subpath of Pi starting at si and ending at a such 
that c appears before z on P[ and d appears after z on P{. We let ui = c,U2 = z, vi = z,V2 = d in 
this case. 

In all cases, we can easily verify that condition a) and b) in the statement of the lemma are 
satisfied. Also, in any case we have that one of the following is true: 

1. ui < U2 and vi -< V2 with ui and vi appearing on different paths in P[,P2 

2. Vi < V2 = Ui ^ U2 

3. Ui -< ti2 = f 1 -< V2 

In each case, the triangle inequality implies duj^u2 + d^^y^ is a lower bound on the total cost of P[ 
and P2. That is, dujU2 + dviV2 ^ OPT. □ 

A. 4 Restricted Instances of General /c-ATSPP 

Theorem A.l There is a 3 -approximation for General k-ATSPP in symmetric metrics. 
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Figure 3: i) An illustration of the case y G P2 with immediate successor w ^ x on The white 
nodes lie on P{ and the gray nodes lie on Pg- ^ dashed arc v ^ v' indicates f -< f ' on P[ and a 
gray arc v v' indicates v ~< v' on ii) The si — ti and S2 — ^2 paths Qi, Q2 formed in the proof 
of Theorem 13.21 for this case. 

Proof. First, we may assume that the 2k start and end locations are all distinct by duplicating 
locations that appear multiple times among si, . . . , Sfc, ti, . . . , t^. Let X = {si, . . . , Sk}U{ti, . . . , tfc}. 

Compute the minimum weight forest F oi G where every component contains exactly one node 
in X. Such forests correspond to spanning trees of G/X, the graph obtained by contracting all 
nodes in X and keeping parallel edges. Now, add the set of edges M = {siti, 1 < i < k} to F. 
This results in a forest where each component contains both Sj and ti from some 1 < i < A; pair. 
In each such component Fj, double all edges except Siti and find an Eulerian walk from Si to U. 
Follow such a walk and shortcut past repeated nodes to get a path Pi that visits all nodes in the 
component Fi. 

Before adding M, the cost of F was at most OPT because a forest where each component 
contains exactly one node in X can be obtained by deleting one edge from every Si — ti path in the 
optimum solution. Also, since the optimum solution consists of Si — ti paths for each 1 < i < k 
and since the single edge Sjtj is the shortest Sj — ti path, then the cost of M is also at most OPT. 
Finally, since we only doubled the edges from F and shortcutting does not increase the cost of a 
path, then the final cost is at most twice the cost of F plus the cost of M which, in turn, is at most 



Theorem A. 2 There is a {[log2{n — k)\ + 1) -approximation for instances of General k-ATSPP in 
asymmetric metrics when Sj = ti for all 1 < i < k. 

Proof. Let denote the common start and end location for salesman i. Recall that we are 
allowing a salesman to travel directly from Si to ti which corresponds to a loop of cost at r,. Like 
Frieze et al. [12], we consider cycle covers except that we also allow a loop at a root to be a cycle 
in the cycle cover. We do not allow loops at any node in y — {ri, . . . , r^}. A minimum cost cycle 
cover that allows loops at root nodes can be computed efficiently in a manner similar to computing 



3 • OPT. 



□ 
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minimum cost cycle covers without loops. 

Initialize the set of "remaining nodes" W := V. While W 7^ {ri,...,rfc}, we compute a 
minimum cost cycle cover C. Note that the cost of C is at most OPT because shortcutting an 
optimum solution past nodes in 1/ — VF yields a feasible cycle cover on W of cost at most OPT. If 
any cycle in C contains more than one root, then shortcut C past all but one of these roots. The 
roots that are removed from the cycle are then said to be covered by a loop of cost 0. For each 
cycle Ci in C, if there is a root r in Q then we remove all of Ci — {r} from W. Otherwise, we only 
remove all but any one arbitrarily chosen node in Ci from W. 

When W is finally reduced to R, the union of all cycles found over all iterations will have 
each component being an Eulerian graph containing exactly one root. So, shortcutting an Eulerian 
circuit in each component yields the desired cycles. At each step before the final iteration, at least 
half of the non-root nodes are removed so the number of iterations is at most [log2(n — k)\ + 1. 
The total cost of these cycles is at most the total cost of all cycles covers each of which has cost at 
most OPT, so the total cost is ([logaCn - k)\ + 1) • OPT. □ 

A. 5 Proof of Claim [331 

Suppose T = {{ui,Vi, Wi)}l''^i is a collection of n vertex-disjoint triangles with Ui £ U,Vi £ V and 
Wi S W. For each such triangle {ui,Vi,Wi), we have the salesman starting at Ui E Xi to travel first 
to Vi € X2, then to Wi G X3, and finally to Ui G X4. Every node in H' is visited since every node is 
included in some triangle in T. By construction, every step taken by a salesman traverses an arc 
of cost so the total cost is 0. 

Conversely, suppose there is a /c-ATSPP solution that avoids using any arc of positive cost. 
Then each of the arcs followed by the salesmen must be in increasing order of the levels Xi . Since 
there are only n salesmen and since each salesman can visit only two nodes in addition to their 
endpoints, then each salesman visits every layer X^. Since no arcs of positive cost were used then 
every salesman only uses arcs corresponding to edges in G. Thus, the nodes visited by each salesman 
correspond to a triangle in G and these triangles partition the nodes of G. □ 

A. 6 Hardness of General /c-ATSPP for Smaller k 

There is a very simple modification to the proof of Theorem 11.21 that proves hardness for smaller 
values of fc. It is reminiscent of some "padding" arguments in complexity theory. 

Theorem A. 3 For any constant e > 0, instances of general k-ATSPP with k = 0,{n'') cannot be 
approximated within any bounded ratio unless P = NP. 

Proof (sketch). Simply attach a path P of length Q{n'^) to the end of the copy of ui € X4 
in H in the reduction from Theorem 11.21 Set the cost of all arcs in this path is 0. All start and 
end locations of the salesmen are the same, except that the first salesman ends at the end of the 
path P rather than at ui £ X4. Now, k = @{N^) where is the number of nodes in this modified 
version of H. □ 
In fact, under the Exponential Time Hypothesis we can push the value of k down to polyloga- 
rithmic in the hardness reduction. 

Theorem A. 4 There is a constant c such that for any 5 > 0, no polynomial-time algorithm for 
instances of General k-ATSPP on N nodes with k = r2(log^ A^) can discern between instances with 
optimum value and instances with optimum value at least 1 unless 3SAT has can he decided in 
time n is the number of variables. 
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Proof. Say the reduction from 3SAT to Tripartite Triangle Packing produces instances of size 
at most rf for large enough n where c is a constant and n is the number of variables in the 3SAT 
instance. Let k denote the size of a partition in the Tripartite Triangle Packing instance and note 
k < n^. As in the proof of Theorem lA. 31 append a path to some ui S X4 except make its length 

2^" —Ak. Note that k = (logg N)s where N = 2^" is the number of nodes in this General /c-ATSPP 
instance. 

Suppose there is a polynomial-time algorithm for General fc-ATSPP instances on nodes that 
can distinguish between instances with optimum value and instances with optimum value at least 
1 when k > (log2 N) ^ . Use this algorithm on the instance constructed in the previous paragraph 
to distinguish between "yes" and "no" instances of 3SAT. 

The running time of the reduction of 3SAT to General /c-ATSPP is polynomial in 2'^'' < 2" . 

s 

Running the polynomial-time algorithm for General A;-ATSPP on the instance with N = 2^" nodes 

o( n( ^\ 
yields an algorithm for 3SAT whose running time is 2 V / which is at most 2 v" / . □ 

A. 7 Minimizing the Salesmen's Makespan 

Here we consider the variant of fc-ATSPP where the goal is to minimize the cost of the most 
expensive path rather than the total cost of all paths. Let OPT denote the minimum value for 
which there are k paths from s to t of maximum length OPT whose union spans all nodes. We 
first recall that in the Directed Orienteering problem, we have an asymmetric metric G = {V, A) 
with arc weights, a budget B, and a fixed start node s. The goal is to visit the maximum number 
of nodes with a path starting at s of cost at most B. 

Theorem A. 5 If there is an a -approximation for Directed Orienteering, then we can efficiently 
find 0{kalogn) paths from s to t, each of length 2 ■ OPT, whose union spans all nodes. If we have 
an a- approximation for the variant of Directed Orienteering when both start and end location are 
specified, then we can find 0{ka\ogn) paths of length at most OPT. 

Proof. We will describe an algorithm that is supplied with a value B. If B > OPT, then it will 
succeed in finding 0(A;alogn) paths of length at most B + OPT whose union covers all nodes. So, 
by binary searching for B and keeping the smallest value for which the algorithm succeeds, we find 
0(A;Q;logn) paths of length at most 2 • OPT. 

The algorithm is simple. Initialize W = V — Then, for Q{ka\ogn) iterations use the 

Directed Orienteering approximation on U {s} starting from node s to find a path P of length 
at most B that covers some nodes. Append t to the end of this path P. Remove the nodes of 
P — {s, t} from W and repeat. Note that each path has length at most B + OPT because the final 
step to t has length at most OPT. liW = ^ after all iterations, then we say that the algorithm 
succeeded. Otherwise, we say the algorithm failed. 

We claim that the algorithm will succeed '\i B > OPT. In each iteration we have, from short- 
cutting the paths in an optimum solution around nodes in y — W , that there is a path of length 
OPT < B that covers at least |Ty|/fc nodes in W. So, the Directed Orienteering algorithm with 
budget B will cover at least \W\/ka of these nodes and the size of W drops by a (l — ^)-fraction. 
After Q{kalogn) iterations all nodes should be covered. 

Finally, we note that we can use essentially the same algorithm to find paths of length OPT if we 
have a a-approximation for Directed Orienteering when we can specify both start and end locations. 
That is, we don't have to extend the paths found by the Directed Orienteering approximation to 
end at t since they already end at i. □ 
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In the Directed /c-Stroll problem, we have an asymmetric metric G = {V, A), an integer k, and 
two nodes s,t. The goal is to find an s — t path of minimum weight that visits at least k other 
nodes. We can also use approximation algorithms for this problem to approximate the variant of 
/c-ATSPP whose goal is to minimize the cost of the most expensive path. 

Theorem A. 6 If there is an a- approximation for the Directed k-Stroll problem, then we can find 
0{k\ogn) paths of length at most a ■ OPT each. 

Proof. Initialize W to V — {s, t} and iterate the following proceduce. Use the Directed 
StroU approximation on II^U{s,t} to find an s — t path P that visits at least |VF|/A: locations in W. 
By shortcutting an optimum solution, we see that there is such a path on the subgraph induced 
by W of length at most OPT so the length of P will be at most a ■ OPT. Remove the nodes in 
P — {s,t} from W and repeat. After 0(A;logn) iterations, all nodes will be covered by paths of 
length at most a ■ OPT. □ 
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